Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 10480 |
| Missing cells | 17402 |
| Missing cells (%) | 9.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.4 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Text | 3 |
| Unsupported | 1 |
| Categorical | 2 |
| DateTime | 1 |
id is highly overall correlated with number_of_reviews | High correlation |
latitude is highly overall correlated with neighbourhood | High correlation |
longitude is highly overall correlated with neighbourhood | High correlation |
neighbourhood is highly overall correlated with latitude and 1 other fields | High correlation |
number_of_reviews is highly overall correlated with id and 2 other fields | High correlation |
number_of_reviews_ltm is highly overall correlated with number_of_reviews and 1 other fields | High correlation |
reviews_per_month is highly overall correlated with number_of_reviews and 1 other fields | High correlation |
room_type is highly imbalanced (63.0%) | Imbalance |
neighbourhood_group has 10480 (100.0%) missing values | Missing |
price has 4606 (44.0%) missing values | Missing |
last_review has 1097 (10.5%) missing values | Missing |
reviews_per_month has 1097 (10.5%) missing values | Missing |
license has 119 (1.1%) missing values | Missing |
price is highly skewed (γ1 = 31.54959784) | Skewed |
minimum_nights is highly skewed (γ1 = 27.30406462) | Skewed |
id has unique values | Unique |
neighbourhood_group is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
number_of_reviews has 1097 (10.5%) zeros | Zeros |
availability_365 has 3999 (38.2%) zeros | Zeros |
number_of_reviews_ltm has 3771 (36.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-25 14:08:25.731988 |
|---|---|
| Analysis finished | 2025-12-25 14:08:48.302375 |
| Duration | 22.57 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
High correlation Unique
| Distinct | 10480 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.925464 × 1017 |
| Minimum | 27886 |
|---|---|
| Maximum | 1.5062874 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 27886 |
|---|---|
| 5-th percentile | 3382607.8 |
| Q1 | 26293728 |
| median | 6.8934743 × 1017 |
| Q3 | 1.1196102 × 1018 |
| 95-th percentile | 1.4374951 × 1018 |
| Maximum | 1.5062874 × 1018 |
| Range | 1.5062874 × 1018 |
| Interquartile range (IQR) | 1.1196102 × 1018 |
Descriptive statistics
| Standard deviation | 5.6206749 × 1017 |
|---|---|
| Coefficient of variation (CV) | 0.94856283 |
| Kurtosis | -1.6378541 |
| Mean | 5.925464 × 1017 |
| Median Absolute Deviation (MAD) | 6.8934743 × 1017 |
| Skewness | 0.10927337 |
| Sum | -6.6665121 × 1018 |
| Variance | 3.1591986 × 1035 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 27886 | 1 | < 0.1% |
| 9.731820618 × 1017 | 1 | < 0.1% |
| 9.708935011 × 1017 | 1 | < 0.1% |
| 9.709002993 × 1017 | 1 | < 0.1% |
| 9.709097119 × 1017 | 1 | < 0.1% |
| 9.709397502 × 1017 | 1 | < 0.1% |
| 9.710403051 × 1017 | 1 | < 0.1% |
| 9.710960194 × 1017 | 1 | < 0.1% |
| 9.714036585 × 1017 | 1 | < 0.1% |
| 9.718410991 × 1017 | 1 | < 0.1% |
| Other values (10470) | 10470 |
| Value | Count | Frequency (%) |
| 27886 | 1 | |
| 28871 | 1 | |
| 29051 | 1 | |
| 44391 | 1 | |
| 48373 | 1 | |
| 49552 | 1 | |
| 50263 | 1 | |
| 50515 | 1 | |
| 50523 | 1 | |
| 53921 | 1 |
| Value | Count | Frequency (%) |
| 1.506287354 × 1018 | 1 | |
| 1.505255614 × 1018 | 1 | |
| 1.5049981 × 1018 | 1 | |
| 1.504985778 × 1018 | 1 | |
| 1.503867342 × 1018 | 1 | |
| 1.503528945 × 1018 | 1 | |
| 1.50347525 × 1018 | 1 | |
| 1.503399985 × 1018 | 1 | |
| 1.503145545 × 1018 | 1 | |
| 1.502865938 × 1018 | 1 |
name
Text
| Distinct | 10186 |
|---|---|
| Distinct (%) | 97.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 82.0 KiB |
Length
| Max length | 66 |
|---|---|
| Median length | 43 |
| Mean length | 37.117653 |
| Min length | 1 |
Unique
| Unique | 10009 ? |
|---|---|
| Unique (%) | 95.5% |
Sample
| 1st row | Romantic, stylish B&B houseboat in canal district |
|---|---|
| 2nd row | Comfortable double room |
| 3rd row | Comfortable single / double room |
| 4th row | Quiet 2-bedroom Amsterdam city centre apartment |
| 5th row | Cozy family home in Amsterdam South |
| Value | Count | Frequency (%) |
| apartment | 3486 | 5.8% |
| in | 3274 | 5.4% |
| amsterdam | 2509 | 4.2% |
| 2125 | 3.5% | |
| with | 1623 | 2.7% |
| the | 1048 | 1.7% |
| spacious | 946 | 1.6% |
| garden | 842 | 1.4% |
| appartement | 841 | 1.4% |
| city | 821 | 1.4% |
| Other values (3789) | 42820 |
Most occurring characters
| Value | Count | Frequency (%) |
| 50026 | 12.9% | |
| e | 35249 | 9.1% |
| t | 31534 | 8.1% |
| a | 29377 | 7.6% |
| r | 24415 | 6.3% |
| n | 22967 | 5.9% |
| o | 20046 | 5.2% |
| i | 19746 | 5.1% |
| m | 16403 | 4.2% |
| s | 12940 | 3.3% |
| Other values (131) | 126290 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 388993 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 50026 | 12.9% | |
| e | 35249 | 9.1% |
| t | 31534 | 8.1% |
| a | 29377 | 7.6% |
| r | 24415 | 6.3% |
| n | 22967 | 5.9% |
| o | 20046 | 5.2% |
| i | 19746 | 5.1% |
| m | 16403 | 4.2% |
| s | 12940 | 3.3% |
| Other values (131) | 126290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 388993 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 50026 | 12.9% | |
| e | 35249 | 9.1% |
| t | 31534 | 8.1% |
| a | 29377 | 7.6% |
| r | 24415 | 6.3% |
| n | 22967 | 5.9% |
| o | 20046 | 5.2% |
| i | 19746 | 5.1% |
| m | 16403 | 4.2% |
| s | 12940 | 3.3% |
| Other values (131) | 126290 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 388993 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 50026 | 12.9% | |
| e | 35249 | 9.1% |
| t | 31534 | 8.1% |
| a | 29377 | 7.6% |
| r | 24415 | 6.3% |
| n | 22967 | 5.9% |
| o | 20046 | 5.2% |
| i | 19746 | 5.1% |
| m | 16403 | 4.2% |
| s | 12940 | 3.3% |
| Other values (131) | 126290 |
host_id
Real number (ℝ)
| Distinct | 9201 |
|---|---|
| Distinct (%) | 87.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3450191 × 108 |
| Minimum | 1662 |
|---|---|
| Maximum | 7.1734696 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 1662 |
|---|---|
| 5-th percentile | 2635611.3 |
| Q1 | 12777805 |
| median | 45478430 |
| Q3 | 1.8771963 × 108 |
| 95-th percentile | 5.4900582 × 108 |
| Maximum | 7.1734696 × 108 |
| Range | 7.1734529 × 108 |
| Interquartile range (IQR) | 1.7494182 × 108 |
Descriptive statistics
| Standard deviation | 1.8043587 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.3415116 |
| Kurtosis | 1.3721331 |
| Mean | 1.3450191 × 108 |
| Median Absolute Deviation (MAD) | 39793439 |
| Skewness | 1.5704815 |
| Sum | 1.40958 × 1012 |
| Variance | 3.2557104 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39110511 | 35 | 0.3% |
| 203731852 | 23 | 0.2% |
| 364305280 | 23 | 0.2% |
| 198405490 | 18 | 0.2% |
| 488984558 | 15 | 0.1% |
| 14574533 | 15 | 0.1% |
| 143098191 | 15 | 0.1% |
| 241644101 | 13 | 0.1% |
| 237150404 | 13 | 0.1% |
| 408898089 | 13 | 0.1% |
| Other values (9191) | 10297 |
| Value | Count | Frequency (%) |
| 1662 | 1 | |
| 3592 | 1 | |
| 14589 | 1 | |
| 42599 | 1 | |
| 57722 | 1 | |
| 59484 | 2 | |
| 70163 | 1 | |
| 72890 | 1 | |
| 77950 | 1 | |
| 92194 | 1 |
| Value | Count | Frequency (%) |
| 717346955 | 1 | |
| 716142178 | 1 | |
| 715849738 | 1 | |
| 715439809 | 1 | |
| 714363048 | 1 | |
| 714007165 | 1 | |
| 711697640 | 1 | |
| 711508049 | 1 | |
| 711168278 | 2 | |
| 711085419 | 1 |
host_name
Text
| Distinct | 3857 |
|---|---|
| Distinct (%) | 36.8% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 82.0 KiB |
Length
| Max length | 44 |
|---|---|
| Median length | 36 |
| Mean length | 6.5630429 |
| Min length | 1 |
Unique
| Unique | 2482 ? |
|---|---|
| Unique (%) | 23.7% |
Sample
| 1st row | Flip |
|---|---|
| 2nd row | Edwin |
| 3rd row | Edwin |
| 4th row | Jan |
| 5th row | Vesna & Misha |
| Value | Count | Frequency (%) |
| 164 | 1.4% | |
| hotel | 121 | 1.0% |
| maria | 75 | 0.6% |
| amsterdam | 68 | 0.6% |
| anna | 67 | 0.6% |
| jan | 62 | 0.5% |
| and | 59 | 0.5% |
| thomas | 58 | 0.5% |
| laura | 56 | 0.5% |
| stephan | 54 | 0.4% |
| Other values (3646) | 11281 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 7633 | 11.1% |
| a | 7497 | 10.9% |
| i | 5724 | 8.3% |
| n | 5554 | 8.1% |
| r | 4508 | 6.6% |
| o | 3457 | 5.0% |
| l | 3261 | 4.7% |
| t | 2711 | 3.9% |
| s | 2483 | 3.6% |
| u | 1624 | 2.4% |
| Other values (82) | 24309 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 68761 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 7633 | 11.1% |
| a | 7497 | 10.9% |
| i | 5724 | 8.3% |
| n | 5554 | 8.1% |
| r | 4508 | 6.6% |
| o | 3457 | 5.0% |
| l | 3261 | 4.7% |
| t | 2711 | 3.9% |
| s | 2483 | 3.6% |
| u | 1624 | 2.4% |
| Other values (82) | 24309 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 68761 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 7633 | 11.1% |
| a | 7497 | 10.9% |
| i | 5724 | 8.3% |
| n | 5554 | 8.1% |
| r | 4508 | 6.6% |
| o | 3457 | 5.0% |
| l | 3261 | 4.7% |
| t | 2711 | 3.9% |
| s | 2483 | 3.6% |
| u | 1624 | 2.4% |
| Other values (82) | 24309 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 68761 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 7633 | 11.1% |
| a | 7497 | 10.9% |
| i | 5724 | 8.3% |
| n | 5554 | 8.1% |
| r | 4508 | 6.6% |
| o | 3457 | 5.0% |
| l | 3261 | 4.7% |
| t | 2711 | 3.9% |
| s | 2483 | 3.6% |
| u | 1624 | 2.4% |
| Other values (82) | 24309 |
neighbourhood_group
Unsupported
Missing Rejected Unsupported
| Missing | 10480 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 82.0 KiB |
neighbourhood
Categorical
High correlation
| Distinct | 22 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 82.0 KiB |
| De Baarsjes - Oud-West | |
|---|---|
| Centrum-West | |
| De Pijp - Rivierenbuurt | |
| Centrum-Oost | |
| Westerpark | |
| Other values (17) |
Length
| Max length | 38 |
|---|---|
| Median length | 23 |
| Mean length | 15.588359 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Centrum-West |
|---|---|
| 2nd row | Centrum-West |
| 3rd row | Centrum-Oost |
| 4th row | Centrum-Oost |
| 5th row | Buitenveldert - Zuidas |
Common Values
| Value | Count | Frequency (%) |
| De Baarsjes - Oud-West | 1808 | |
| Centrum-West | 1207 | |
| De Pijp - Rivierenbuurt | 1199 | |
| Centrum-Oost | 923 | |
| Westerpark | 736 | |
| Zuid | 735 | |
| Oud-Oost | 654 | 6.2% |
| Bos en Lommer | 547 | 5.2% |
| Oud-Noord | 485 | 4.6% |
| Oostelijk Havengebied - Indische Buurt | 436 | 4.2% |
| Other values (12) | 1750 |
Length
| Value | Count | Frequency (%) |
| 4022 | ||
| de | 3074 | |
| oud-west | 1808 | 7.7% |
| baarsjes | 1808 | 7.7% |
| centrum-west | 1207 | 5.1% |
| pijp | 1199 | 5.1% |
| rivierenbuurt | 1199 | 5.1% |
| centrum-oost | 923 | 3.9% |
| westerpark | 736 | 3.1% |
| zuid | 735 | 3.1% |
| Other values (27) | 6920 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 21081 | 12.9% |
| 13151 | 8.1% | |
| r | 12509 | 7.7% |
| s | 11447 | 7.0% |
| t | 11305 | 6.9% |
| u | 9980 | 6.1% |
| - | 9689 | 5.9% |
| a | 6647 | 4.1% |
| i | 6309 | 3.9% |
| d | 6279 | 3.8% |
| Other values (31) | 54969 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 163366 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 21081 | 12.9% |
| 13151 | 8.1% | |
| r | 12509 | 7.7% |
| s | 11447 | 7.0% |
| t | 11305 | 6.9% |
| u | 9980 | 6.1% |
| - | 9689 | 5.9% |
| a | 6647 | 4.1% |
| i | 6309 | 3.9% |
| d | 6279 | 3.8% |
| Other values (31) | 54969 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 163366 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 21081 | 12.9% |
| 13151 | 8.1% | |
| r | 12509 | 7.7% |
| s | 11447 | 7.0% |
| t | 11305 | 6.9% |
| u | 9980 | 6.1% |
| - | 9689 | 5.9% |
| a | 6647 | 4.1% |
| i | 6309 | 3.9% |
| d | 6279 | 3.8% |
| Other values (31) | 54969 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 163366 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 21081 | 12.9% |
| 13151 | 8.1% | |
| r | 12509 | 7.7% |
| s | 11447 | 7.0% |
| t | 11305 | 6.9% |
| u | 9980 | 6.1% |
| - | 9689 | 5.9% |
| a | 6647 | 4.1% |
| i | 6309 | 3.9% |
| d | 6279 | 3.8% |
| Other values (31) | 54969 |
latitude
Real number (ℝ)
High correlation
| Distinct | 7583 |
|---|---|
| Distinct (%) | 72.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.366679 |
| Minimum | 52.290276 |
|---|---|
| Maximum | 52.42512 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 52.290276 |
|---|---|
| 5-th percentile | 52.342858 |
| Q1 | 52.355694 |
| median | 52.36569 |
| Q3 | 52.37651 |
| 95-th percentile | 52.396064 |
| Maximum | 52.42512 |
| Range | 0.13484378 |
| Interquartile range (IQR) | 0.020816359 |
Descriptive statistics
| Standard deviation | 0.017246466 |
|---|---|
| Coefficient of variation (CV) | 0.00032934046 |
| Kurtosis | 1.8918496 |
| Mean | 52.366679 |
| Median Absolute Deviation (MAD) | 0.010406802 |
| Skewness | 0.0079005003 |
| Sum | 548802.8 |
| Variance | 0.0002974406 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 52.36503983 | 13 | 0.1% |
| 52.3881 | 12 | 0.1% |
| 52.3675023 | 11 | 0.1% |
| 52.3399542 | 10 | 0.1% |
| 52.3679231 | 10 | 0.1% |
| 52.36379 | 10 | 0.1% |
| 52.37272 | 10 | 0.1% |
| 52.3742592 | 10 | 0.1% |
| 52.3643 | 10 | 0.1% |
| 52.35455 | 9 | 0.1% |
| Other values (7573) | 10375 |
| Value | Count | Frequency (%) |
| 52.29027622 | 1 | < 0.1% |
| 52.29122 | 1 | < 0.1% |
| 52.29125 | 1 | < 0.1% |
| 52.29132 | 1 | < 0.1% |
| 52.29158 | 1 | < 0.1% |
| 52.29166 | 1 | < 0.1% |
| 52.29189 | 1 | < 0.1% |
| 52.2921452 | 1 | < 0.1% |
| 52.2924678 | 1 | < 0.1% |
| 52.29247 | 6 |
| Value | Count | Frequency (%) |
| 52.42512 | 1 | |
| 52.42476 | 1 | |
| 52.42473 | 1 | |
| 52.42467 | 1 | |
| 52.42461 | 1 | |
| 52.4237 | 1 | |
| 52.42344669 | 1 | |
| 52.42321 | 1 | |
| 52.423 | 1 | |
| 52.422673 | 2 |
longitude
Real number (ℝ)
High correlation
| Distinct | 8616 |
|---|---|
| Distinct (%) | 82.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.8894471 |
| Minimum | 4.75587 |
|---|---|
| Maximum | 5.02815 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 4.75587 |
|---|---|
| 5-th percentile | 4.8444293 |
| Q1 | 4.8646175 |
| median | 4.8875165 |
| Q3 | 4.9086747 |
| 95-th percentile | 4.9456944 |
| Maximum | 5.02815 |
| Range | 0.27228 |
| Interquartile range (IQR) | 0.044057224 |
Descriptive statistics
| Standard deviation | 0.034821211 |
|---|---|
| Coefficient of variation (CV) | 0.0071217074 |
| Kurtosis | 1.1275532 |
| Mean | 4.8894471 |
| Median Absolute Deviation (MAD) | 0.021913535 |
| Skewness | 0.49770836 |
| Sum | 51241.405 |
| Variance | 0.0012125168 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.91438 | 13 | 0.1% |
| 4.909975052 | 13 | 0.1% |
| 4.8889219 | 11 | 0.1% |
| 4.9241543 | 10 | 0.1% |
| 4.8963624 | 10 | 0.1% |
| 4.8992577 | 10 | 0.1% |
| 4.88867 | 10 | 0.1% |
| 4.86797 | 9 | 0.1% |
| 4.9121 | 8 | 0.1% |
| 4.8961103 | 8 | 0.1% |
| Other values (8606) | 10378 |
| Value | Count | Frequency (%) |
| 4.75587 | 1 | |
| 4.756656699 | 1 | |
| 4.77145 | 1 | |
| 4.77373 | 1 | |
| 4.77483 | 1 | |
| 4.777866924 | 1 | |
| 4.77799 | 1 | |
| 4.777999952 | 1 | |
| 4.7781 | 1 | |
| 4.77847 | 1 |
| Value | Count | Frequency (%) |
| 5.02815 | 1 | |
| 5.026668638 | 1 | |
| 5.02485 | 1 | |
| 5.01889 | 1 | |
| 5.01839 | 1 | |
| 5.01813 | 1 | |
| 5.01712 | 1 | |
| 5.01667 | 1 | |
| 5.016412258 | 1 | |
| 5.016377204 | 1 |
room_type
Categorical
Imbalance
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 82.0 KiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Hotel room | 49 |
| Shared room | 31 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.438359 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Private room |
|---|---|
| 2nd row | Private room |
| 3rd row | Private room |
| 4th row | Entire home/apt |
| 5th row | Entire home/apt |
Common Values
| Value | Count | Frequency (%) |
| Entire home/apt | 8561 | |
| Private room | 1839 | 17.5% |
| Hotel room | 49 | 0.5% |
| Shared room | 31 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| entire | 8561 | |
| home/apt | 8561 | |
| room | 1919 | 9.2% |
| private | 1839 | 8.8% |
| hotel | 49 | 0.2% |
| shared | 31 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 19041 | |
| t | 19010 | |
| o | 12448 | |
| r | 12350 | |
| m | 10480 | 6.9% |
| 10480 | 6.9% | |
| a | 10431 | 6.9% |
| i | 10400 | 6.9% |
| h | 8592 | 5.7% |
| p | 8561 | 5.7% |
| Other values (9) | 29521 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 151314 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 19041 | |
| t | 19010 | |
| o | 12448 | |
| r | 12350 | |
| m | 10480 | 6.9% |
| 10480 | 6.9% | |
| a | 10431 | 6.9% |
| i | 10400 | 6.9% |
| h | 8592 | 5.7% |
| p | 8561 | 5.7% |
| Other values (9) | 29521 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 151314 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 19041 | |
| t | 19010 | |
| o | 12448 | |
| r | 12350 | |
| m | 10480 | 6.9% |
| 10480 | 6.9% | |
| a | 10431 | 6.9% |
| i | 10400 | 6.9% |
| h | 8592 | 5.7% |
| p | 8561 | 5.7% |
| Other values (9) | 29521 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 151314 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 19041 | |
| t | 19010 | |
| o | 12448 | |
| r | 12350 | |
| m | 10480 | 6.9% |
| 10480 | 6.9% | |
| a | 10431 | 6.9% |
| i | 10400 | 6.9% |
| h | 8592 | 5.7% |
| p | 8561 | 5.7% |
| Other values (9) | 29521 |
price
Real number (ℝ)
Missing Skewed
| Distinct | 663 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 4606 |
| Missing (%) | 44.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 336.78515 |
| Minimum | 35 |
|---|---|
| Maximum | 80018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 35 |
|---|---|
| 5-th percentile | 93 |
| Q1 | 161 |
| median | 222 |
| Q3 | 314 |
| 95-th percentile | 550 |
| Maximum | 80018 |
| Range | 79983 |
| Interquartile range (IQR) | 153 |
Descriptive statistics
| Standard deviation | 1985.6619 |
|---|---|
| Coefficient of variation (CV) | 5.8959305 |
| Kurtosis | 1096.4667 |
| Mean | 336.78515 |
| Median Absolute Deviation (MAD) | 72 |
| Skewness | 31.549598 |
| Sum | 1978276 |
| Variance | 3942853.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 225 | 100 | 1.0% |
| 180 | 96 | 0.9% |
| 200 | 91 | 0.9% |
| 250 | 67 | 0.6% |
| 270 | 66 | 0.6% |
| 300 | 64 | 0.6% |
| 162 | 55 | 0.5% |
| 190 | 48 | 0.5% |
| 315 | 46 | 0.4% |
| 198 | 42 | 0.4% |
| Other values (653) | 5199 | |
| (Missing) | 4606 |
| Value | Count | Frequency (%) |
| 35 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 46 | 1 | < 0.1% |
| 49 | 2 | < 0.1% |
| 50 | 1 | < 0.1% |
| 51 | 1 | < 0.1% |
| 53 | 3 | |
| 56 | 4 | |
| 57 | 5 |
| Value | Count | Frequency (%) |
| 80018 | 2 | |
| 50000 | 2 | |
| 40000 | 3 | |
| 13978 | 1 | < 0.1% |
| 11000 | 1 | < 0.1% |
| 10000 | 1 | < 0.1% |
| 9999 | 1 | < 0.1% |
| 6474 | 1 | < 0.1% |
| 5200 | 2 | |
| 4500 | 1 | < 0.1% |
minimum_nights
Real number (ℝ)
Skewed
| Distinct | 55 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.3902672 |
| Minimum | 1 |
|---|---|
| Maximum | 1001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 7 |
| Maximum | 1001 |
| Range | 1000 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 19.80735 |
|---|---|
| Coefficient of variation (CV) | 4.5116503 |
| Kurtosis | 1009.9475 |
| Mean | 4.3902672 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 27.304065 |
| Sum | 46010 |
| Variance | 392.33112 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2961 | |
| 2 | 2958 | |
| 1 | 1822 | |
| 4 | 1109 | 10.6% |
| 5 | 731 | 7.0% |
| 7 | 294 | 2.8% |
| 6 | 179 | 1.7% |
| 14 | 65 | 0.6% |
| 10 | 63 | 0.6% |
| 30 | 43 | 0.4% |
| Other values (45) | 255 | 2.4% |
| Value | Count | Frequency (%) |
| 1 | 1822 | |
| 2 | 2958 | |
| 3 | 2961 | |
| 4 | 1109 | 10.6% |
| 5 | 731 | 7.0% |
| 6 | 179 | 1.7% |
| 7 | 294 | 2.8% |
| 8 | 27 | 0.3% |
| 9 | 13 | 0.1% |
| 10 | 63 | 0.6% |
| Value | Count | Frequency (%) |
| 1001 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| 365 | 3 | |
| 364 | 4 | |
| 363 | 5 | |
| 360 | 1 | < 0.1% |
| 300 | 2 | < 0.1% |
| 299 | 1 | < 0.1% |
| 210 | 1 | < 0.1% |
| 180 | 2 | < 0.1% |
number_of_reviews
Real number (ℝ)
High correlation Zeros
| Distinct | 573 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.813359 |
| Minimum | 0 |
|---|---|
| Maximum | 5097 |
| Zeros | 1097 |
| Zeros (%) | 10.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 10 |
| Q3 | 30 |
| 95-th percentile | 264 |
| Maximum | 5097 |
| Range | 5097 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 131.50744 |
|---|---|
| Coefficient of variation (CV) | 2.750433 |
| Kurtosis | 312.19347 |
| Mean | 47.813359 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 11.734389 |
| Sum | 501084 |
| Variance | 17294.207 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1097 | 10.5% |
| 1 | 641 | 6.1% |
| 2 | 605 | 5.8% |
| 3 | 541 | 5.2% |
| 4 | 495 | 4.7% |
| 5 | 450 | 4.3% |
| 6 | 393 | 3.8% |
| 7 | 355 | 3.4% |
| 8 | 293 | 2.8% |
| 9 | 282 | 2.7% |
| Other values (563) | 5328 |
| Value | Count | Frequency (%) |
| 0 | 1097 | |
| 1 | 641 | |
| 2 | 605 | |
| 3 | 541 | |
| 4 | 495 | |
| 5 | 450 | |
| 6 | 393 | 3.8% |
| 7 | 355 | 3.4% |
| 8 | 293 | 2.8% |
| 9 | 282 | 2.7% |
| Value | Count | Frequency (%) |
| 5097 | 1 | |
| 3726 | 1 | |
| 3187 | 1 | |
| 1879 | 1 | |
| 1477 | 1 | |
| 1445 | 1 | |
| 1133 | 1 | |
| 1095 | 1 | |
| 1080 | 1 | |
| 1065 | 1 |
last_review
Date
Missing
| Distinct | 1403 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 1097 |
| Missing (%) | 10.5% |
| Memory size | 82.0 KiB |
| Minimum | 2014-01-04 00:00:00 |
|---|---|
| Maximum | 2025-09-11 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
reviews_per_month
Real number (ℝ)
High correlation Missing
| Distinct | 687 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 1097 |
| Missing (%) | 10.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9986678 |
| Minimum | 0.01 |
|---|---|
| Maximum | 99.42 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.06 |
| Q1 | 0.2 |
| median | 0.41 |
| Q3 | 0.91 |
| 95-th percentile | 4.02 |
| Maximum | 99.42 |
| Range | 99.41 |
| Interquartile range (IQR) | 0.71 |
Descriptive statistics
| Standard deviation | 2.3061429 |
|---|---|
| Coefficient of variation (CV) | 2.3092193 |
| Kurtosis | 521.3215 |
| Mean | 0.9986678 |
| Median Absolute Deviation (MAD) | 0.26 |
| Skewness | 16.982169 |
| Sum | 9370.5 |
| Variance | 5.3182952 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.08 | 166 | 1.6% |
| 0.24 | 166 | 1.6% |
| 0.12 | 161 | 1.5% |
| 0.15 | 149 | 1.4% |
| 0.14 | 146 | 1.4% |
| 0.13 | 145 | 1.4% |
| 0.11 | 143 | 1.4% |
| 0.19 | 133 | 1.3% |
| 0.18 | 133 | 1.3% |
| 0.17 | 131 | 1.2% |
| Other values (677) | 7910 | |
| (Missing) | 1097 | 10.5% |
| Value | Count | Frequency (%) |
| 0.01 | 27 | 0.3% |
| 0.02 | 49 | 0.5% |
| 0.03 | 96 | |
| 0.04 | 107 | |
| 0.05 | 95 | |
| 0.06 | 126 | |
| 0.07 | 127 | |
| 0.08 | 166 | |
| 0.09 | 130 | |
| 0.1 | 116 |
| Value | Count | Frequency (%) |
| 99.42 | 1 | |
| 67.87 | 1 | |
| 51.91 | 1 | |
| 47.05 | 1 | |
| 44.82 | 1 | |
| 42.52 | 1 | |
| 36.6 | 1 | |
| 36.5 | 1 | |
| 33.33 | 1 | |
| 28.09 | 1 |
calculated_host_listings_count
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.844084 |
| Minimum | 1 |
|---|---|
| Maximum | 35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 6 |
| Maximum | 35 |
| Range | 34 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.1590959 |
|---|---|
| Coefficient of variation (CV) | 1.7130976 |
| Kurtosis | 50.72472 |
| Mean | 1.844084 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.3606455 |
| Sum | 19326 |
| Variance | 9.9798867 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 8623 | |
| 2 | 750 | 7.2% |
| 3 | 231 | 2.2% |
| 4 | 156 | 1.5% |
| 5 | 125 | 1.2% |
| 6 | 108 | 1.0% |
| 7 | 70 | 0.7% |
| 9 | 54 | 0.5% |
| 13 | 52 | 0.5% |
| 10 | 50 | 0.5% |
| Other values (7) | 261 | 2.5% |
| Value | Count | Frequency (%) |
| 1 | 8623 | |
| 2 | 750 | 7.2% |
| 3 | 231 | 2.2% |
| 4 | 156 | 1.5% |
| 5 | 125 | 1.2% |
| 6 | 108 | 1.0% |
| 7 | 70 | 0.7% |
| 8 | 48 | 0.5% |
| 9 | 54 | 0.5% |
| 10 | 50 | 0.5% |
| Value | Count | Frequency (%) |
| 35 | 35 | |
| 23 | 46 | |
| 18 | 18 | 0.2% |
| 15 | 45 | |
| 13 | 52 | |
| 12 | 36 | |
| 11 | 33 | |
| 10 | 50 | |
| 9 | 54 | |
| 8 | 48 |
availability_365
Real number (ℝ)
Zeros
| Distinct | 366 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.999809 |
| Minimum | 0 |
|---|---|
| Maximum | 365 |
| Zeros | 3999 |
| Zeros (%) | 38.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 20 |
| Q3 | 173 |
| 95-th percentile | 347 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 173 |
Descriptive statistics
| Standard deviation | 122.27616 |
|---|---|
| Coefficient of variation (CV) | 1.3008128 |
| Kurtosis | -0.49198934 |
| Mean | 93.999809 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 1.0189258 |
| Sum | 985118 |
| Variance | 14951.459 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3999 | |
| 2 | 108 | 1.0% |
| 3 | 103 | 1.0% |
| 1 | 101 | 1.0% |
| 8 | 97 | 0.9% |
| 253 | 94 | 0.9% |
| 4 | 82 | 0.8% |
| 5 | 80 | 0.8% |
| 9 | 77 | 0.7% |
| 10 | 67 | 0.6% |
| Other values (356) | 5672 |
| Value | Count | Frequency (%) |
| 0 | 3999 | |
| 1 | 101 | 1.0% |
| 2 | 108 | 1.0% |
| 3 | 103 | 1.0% |
| 4 | 82 | 0.8% |
| 5 | 80 | 0.8% |
| 6 | 55 | 0.5% |
| 7 | 66 | 0.6% |
| 8 | 97 | 0.9% |
| 9 | 77 | 0.7% |
| Value | Count | Frequency (%) |
| 365 | 47 | |
| 364 | 47 | |
| 363 | 39 | |
| 362 | 44 | |
| 361 | 24 | |
| 360 | 16 | 0.2% |
| 359 | 21 | |
| 358 | 50 | |
| 357 | 16 | 0.2% |
| 356 | 25 |
number_of_reviews_ltm
Real number (ℝ)
High correlation Zeros
| Distinct | 149 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.5880725 |
| Minimum | 0 |
|---|---|
| Maximum | 949 |
| Zeros | 3771 |
| Zeros (%) | 36.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 82.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 6 |
| 95-th percentile | 47 |
| Maximum | 949 |
| Range | 949 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 25.195305 |
|---|---|
| Coefficient of variation (CV) | 2.9337555 |
| Kurtosis | 406.84532 |
| Mean | 8.5880725 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 15.012953 |
| Sum | 90003 |
| Variance | 634.80339 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3771 | |
| 1 | 1082 | 10.3% |
| 2 | 947 | 9.0% |
| 3 | 763 | 7.3% |
| 4 | 614 | 5.9% |
| 5 | 470 | 4.5% |
| 6 | 390 | 3.7% |
| 7 | 281 | 2.7% |
| 8 | 229 | 2.2% |
| 9 | 156 | 1.5% |
| Other values (139) | 1777 |
| Value | Count | Frequency (%) |
| 0 | 3771 | |
| 1 | 1082 | 10.3% |
| 2 | 947 | 9.0% |
| 3 | 763 | 7.3% |
| 4 | 614 | 5.9% |
| 5 | 470 | 4.5% |
| 6 | 390 | 3.7% |
| 7 | 281 | 2.7% |
| 8 | 229 | 2.2% |
| 9 | 156 | 1.5% |
| Value | Count | Frequency (%) |
| 949 | 1 | |
| 813 | 1 | |
| 665 | 1 | |
| 598 | 1 | |
| 552 | 1 | |
| 485 | 1 | |
| 374 | 1 | |
| 359 | 1 | |
| 325 | 1 | |
| 275 | 1 |
license
Text
Missing
| Distinct | 9080 |
|---|---|
| Distinct (%) | 87.6% |
| Missing | 119 |
| Missing (%) | 1.1% |
| Memory size | 82.0 KiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 22.246115 |
| Min length | 6 |
Unique
| Unique | 8702 ? |
|---|---|
| Unique (%) | 84.0% |
Sample
| 1st row | 0363 974D 4986 7411 88D8 |
|---|---|
| 2nd row | 0363 607B EA74 0BD8 2F6F |
| 3rd row | 0363 607B EA74 0BD8 2F6F |
| 4th row | 0363 E76E F06A C1DD 172C |
| 5th row | 0363 4A2B A6AD 0196 F684 |
| Value | Count | Frequency (%) |
| 0363 | 8474 | 19.0% |
| exempt | 778 | 1.7% |
| abcd | 21 | < 0.1% |
| ab12 | 18 | < 0.1% |
| 1234 | 17 | < 0.1% |
| 0000 | 16 | < 0.1% |
| 790e | 16 | < 0.1% |
| 78ad | 15 | < 0.1% |
| 8875 | 15 | < 0.1% |
| 3c05 | 15 | < 0.1% |
| Other values (26590) | 35140 |
Most occurring characters
| Value | Count | Frequency (%) |
| 34164 | ||
| 3 | 28732 | |
| 0 | 19338 | 8.4% |
| 6 | 18949 | 8.2% |
| E | 10239 | 4.4% |
| 9 | 9836 | 4.3% |
| 1 | 9711 | 4.2% |
| C | 9627 | 4.2% |
| 8 | 9611 | 4.2% |
| 5 | 9581 | 4.2% |
| Other values (43) | 70704 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 230492 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 34164 | ||
| 3 | 28732 | |
| 0 | 19338 | 8.4% |
| 6 | 18949 | 8.2% |
| E | 10239 | 4.4% |
| 9 | 9836 | 4.3% |
| 1 | 9711 | 4.2% |
| C | 9627 | 4.2% |
| 8 | 9611 | 4.2% |
| 5 | 9581 | 4.2% |
| Other values (43) | 70704 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 230492 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 34164 | ||
| 3 | 28732 | |
| 0 | 19338 | 8.4% |
| 6 | 18949 | 8.2% |
| E | 10239 | 4.4% |
| 9 | 9836 | 4.3% |
| 1 | 9711 | 4.2% |
| C | 9627 | 4.2% |
| 8 | 9611 | 4.2% |
| 5 | 9581 | 4.2% |
| Other values (43) | 70704 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 230492 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 34164 | ||
| 3 | 28732 | |
| 0 | 19338 | 8.4% |
| 6 | 18949 | 8.2% |
| E | 10239 | 4.4% |
| 9 | 9836 | 4.3% |
| 1 | 9711 | 4.2% |
| C | 9627 | 4.2% |
| 8 | 9611 | 4.2% |
| 5 | 9581 | 4.2% |
| Other values (43) | 70704 |
Interactions
Correlations
| availability_365 | calculated_host_listings_count | host_id | id | latitude | longitude | minimum_nights | neighbourhood | number_of_reviews | number_of_reviews_ltm | price | reviews_per_month | room_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| availability_365 | 1.000 | 0.269 | 0.149 | 0.171 | -0.008 | 0.029 | -0.184 | 0.064 | 0.100 | 0.373 | 0.120 | 0.408 | 0.171 |
| calculated_host_listings_count | 0.269 | 1.000 | 0.222 | 0.040 | 0.037 | 0.059 | -0.289 | 0.116 | 0.150 | 0.198 | -0.144 | 0.273 | 0.266 |
| host_id | 0.149 | 0.222 | 1.000 | 0.362 | -0.047 | 0.016 | -0.225 | 0.066 | -0.138 | 0.061 | 0.014 | 0.142 | 0.141 |
| id | 0.171 | 0.040 | 0.362 | 1.000 | -0.019 | -0.018 | -0.111 | 0.045 | -0.635 | -0.044 | 0.102 | 0.133 | 0.114 |
| latitude | -0.008 | 0.037 | -0.047 | -0.019 | 1.000 | -0.077 | -0.027 | 0.682 | 0.060 | 0.064 | -0.068 | 0.074 | 0.109 |
| longitude | 0.029 | 0.059 | 0.016 | -0.018 | -0.077 | 1.000 | 0.001 | 0.669 | 0.032 | 0.024 | -0.036 | 0.040 | 0.093 |
| minimum_nights | -0.184 | -0.289 | -0.225 | -0.111 | -0.027 | 0.001 | 1.000 | 0.051 | -0.147 | -0.239 | 0.081 | -0.344 | 0.000 |
| neighbourhood | 0.064 | 0.116 | 0.066 | 0.045 | 0.682 | 0.669 | 0.051 | 1.000 | 0.075 | 0.056 | 0.133 | 0.061 | 0.177 |
| number_of_reviews | 0.100 | 0.150 | -0.138 | -0.635 | 0.060 | 0.032 | -0.147 | 0.075 | 1.000 | 0.613 | -0.213 | 0.590 | 0.148 |
| number_of_reviews_ltm | 0.373 | 0.198 | 0.061 | -0.044 | 0.064 | 0.024 | -0.239 | 0.056 | 0.613 | 1.000 | -0.229 | 0.788 | 0.129 |
| price | 0.120 | -0.144 | 0.014 | 0.102 | -0.068 | -0.036 | 0.081 | 0.133 | -0.213 | -0.229 | 1.000 | -0.301 | 0.207 |
| reviews_per_month | 0.408 | 0.273 | 0.142 | 0.133 | 0.074 | 0.040 | -0.344 | 0.061 | 0.590 | 0.788 | -0.301 | 1.000 | 0.125 |
| room_type | 0.171 | 0.266 | 0.141 | 0.114 | 0.109 | 0.093 | 0.000 | 0.177 | 0.148 | 0.129 | 0.207 | 0.125 | 1.000 |
Missing values
Sample
| id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | number_of_reviews_ltm | license | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 27886 | Romantic, stylish B&B houseboat in canal district | 97647 | Flip | NaN | Centrum-West | 52.387610 | 4.891880 | Private room | 132.0 | 3 | 311 | 2025-09-07 | 1.87 | 1 | 17 | 33 | 0363 974D 4986 7411 88D8 |
| 1 | 28871 | Comfortable double room | 124245 | Edwin | NaN | Centrum-West | 52.367750 | 4.890920 | Private room | 89.0 | 2 | 732 | 2025-09-07 | 3.99 | 2 | 126 | 93 | 0363 607B EA74 0BD8 2F6F |
| 2 | 29051 | Comfortable single / double room | 124245 | Edwin | NaN | Centrum-Oost | 52.365840 | 4.891110 | Private room | 61.0 | 2 | 849 | 2025-09-08 | 4.81 | 2 | 95 | 86 | 0363 607B EA74 0BD8 2F6F |
| 3 | 44391 | Quiet 2-bedroom Amsterdam city centre apartment | 194779 | Jan | NaN | Centrum-Oost | 52.371680 | 4.914710 | Entire home/apt | NaN | 3 | 42 | 2022-08-20 | 0.23 | 1 | 0 | 0 | 0363 E76E F06A C1DD 172C |
| 4 | 48373 | Cozy family home in Amsterdam South | 220434 | Vesna & Misha | NaN | Buitenveldert - Zuidas | 52.327808 | 4.876800 | Entire home/apt | NaN | 3 | 5 | 2024-04-28 | 0.19 | 1 | 0 | 0 | 0363 4A2B A6AD 0196 F684 |
| 5 | 49552 | Multatuli Luxury Guest Suite in top location | 225987 | Joanna & MP | NaN | Centrum-West | 52.380280 | 4.890890 | Entire home/apt | 322.0 | 3 | 609 | 2025-08-26 | 3.36 | 1 | 223 | 53 | 0363 576A D827 5085 6B83 |
| 6 | 50263 | Central de Lux 2 bedrooms (4p) apt 125 sqm | 230246 | Donald | NaN | Centrum-Oost | 52.369378 | 4.929579 | Entire home/apt | 457.0 | 2 | 177 | 2025-09-01 | 0.97 | 1 | 354 | 11 | 0363 7F3D 0BAE 28C8 C7D2 |
| 7 | 50515 | Family Home (No drugs, smoking or parties) | 231864 | Karin | NaN | Bos en Lommer | 52.375590 | 4.838570 | Entire home/apt | 198.0 | 7 | 20 | 2025-08-23 | 0.15 | 1 | 244 | 3 | 0363 5DDB E495 A6D5 CEC6 |
| 8 | 50523 | B & B de 9 Straatjes (city center) | 231946 | Raymond | NaN | Centrum-West | 52.369590 | 4.884230 | Entire home/apt | 162.0 | 2 | 563 | 2025-08-24 | 3.15 | 1 | 261 | 79 | 0363 22DC 0E52 B70B 0FB8 |
| 9 | 53921 | Amsterdam Stylish Lakeview Apartment | 252245 | Ingrid | NaN | IJburg - Zeeburgereiland | 52.355590 | 5.003200 | Entire home/apt | NaN | 1 | 12 | 2024-05-25 | 0.14 | 1 | 0 | 0 | 0363 B43C B1D4 2666 3739 |
| id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | number_of_reviews_ltm | license | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10470 | 1502865938487978016 | Cosy one bedroom apartment in Amsterdam Noord | 102449031 | Diosa | NaN | Oud-Noord | 52.399030 | 4.915410 | Entire home/apt | 137.0 | 1 | 0 | NaN | NaN | 1 | 17 | 0 | 0363 2AB1 E13E D42F 386D |
| 10471 | 1503145545400026458 | Downtown apartment w/terrasse and view to a canal | 22534485 | Hamilton | NaN | Centrum-Oost | 52.361827 | 4.905296 | Private room | 87.0 | 1 | 0 | NaN | NaN | 1 | 87 | 0 | NaN |
| 10472 | 1503399985423712696 | The Staal House | 420845707 | Hank | NaN | Centrum-Oost | 52.368156 | 4.898271 | Entire home/apt | 500.0 | 1 | 0 | NaN | NaN | 1 | 70 | 0 | 0363 7302 7B95 4CF8 8169 |
| 10473 | 1503475250056511603 | Comfortable spacious apartment in great location | 175885486 | Christian | NaN | De Baarsjes - Oud-West | 52.368652 | 4.855999 | Entire home/apt | 149.0 | 2 | 0 | NaN | NaN | 1 | 291 | 0 | 0363 AA29 4900 1AD4 76A9 |
| 10474 | 1503528945274992542 | Pearl of the Cuyp | Modern Comfort in De Pijp | 717346955 | Dennis | NaN | De Pijp - Rivierenbuurt | 52.354230 | 4.895250 | Entire home/apt | 423.0 | 2 | 0 | NaN | NaN | 1 | 352 | 0 | 0363 033B 7FC4 5C3A DB9F |
| 10475 | 1503867342263201504 | test host, don't book | 78127165 | Kaiying | NaN | Centrum-Oost | 52.359900 | 4.905820 | Entire home/apt | 6474.0 | 1 | 0 | NaN | NaN | 1 | 365 | 0 | NaN |
| 10476 | 1504985777531398085 | Bright studio with canal view | 613779 | Silvana | NaN | Bos en Lommer | 52.377930 | 4.846690 | Entire home/apt | 130.0 | 1 | 0 | NaN | NaN | 4 | 176 | 0 | 0363F7E548AEB29F3BA3 |
| 10477 | 1504998100462399057 | Bright & Spacious Luxury Corner Apartment | 715849738 | Jason | NaN | Westerpark | 52.373780 | 4.871858 | Entire home/apt | 499.0 | 5 | 0 | NaN | NaN | 1 | 20 | 0 | 0363 5876 BBB2 EF1F 097D |
| 10478 | 1505255613607359391 | Bright and Spacious Ground Floor App. with Garden | 31681093 | Jerom | NaN | Oud-Noord | 52.386550 | 4.917210 | Entire home/apt | 144.0 | 1 | 0 | NaN | NaN | 1 | 27 | 0 | 0363 3146 D0B7 73A7 E9FA |
| 10479 | 1506287353709120640 | Stylish & cozy apartment in Amsterdam West | 32442604 | Lisa | NaN | Bos en Lommer | 52.376299 | 4.860642 | Entire home/apt | 152.0 | 2 | 0 | NaN | NaN | 1 | 26 | 0 | 0363 044C 2949 EABC E0B8 |